AITopics | gradient tree

Collaborating Authors

gradient tree

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

An RKHS Perspective on Tree Ensembles

Dagdoug, Mehdi, Dombry, Clement, Duchamps, Jean-Jil

arXiv.org Machine LearningDec-2-2025

Random Forests and Gradient Boosting are among the most effective algorithms for supervised learning on tabular data. Both belong to the class of tree-based ensemble methods, where predictions are obtained by aggregating many randomized regression trees. In this paper, we develop a theoretical framework for analyzing such methods through Reproducing Kernel Hilbert Spaces (RKHSs) constructed on tree ensembles--more precisely, on the random partitions generated by randomized regression trees. We establish fundamental analytical properties of the resulting Random Forest kernel, including boundedness, continuity, and universality, and show that a Random Forest predictor can be characterized as the unique minimizer of a penalized empirical risk functional in this RKHS, providing a variational interpretation of ensemble learning. We further extend this perspective to the continuous-time formulation of Gradient Boosting introduced by Dombry and Duchamps (2024a,b), and demonstrate that it corresponds to a gradient flow on a Hilbert manifold induced by the Random Forest RKHS. A key feature of this framework is that both the kernel and the RKHS geometry are data-dependent, offering a theoretical explanation for the strong empirical performance of tree-based ensembles. Finally, we illustrate the practical potential of this approach by introducing a kernel principal component analysis built on the Random Forest kernel, which enhances the interpretability of ensemble models, as well as GVI, a new geometric variable importance criterion.

kernel, partition, rkh perspective, (16 more...)

arXiv.org Machine Learning

2512.00397

Country:

North America > United States > New York > New York County > New York City (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
Europe > France (0.04)
(4 more...)

Genre: Research Report > New Finding (0.45)

Industry: Health & Medicine > Therapeutic Area (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)

Add feedback

Infinitesimal gradient boosting

Dombry, Clément, Duchamps, Jean-Jil

arXiv.org Machine LearningApr-26-2021

We define infinitesimal gradient boosting as a limit of the popular tree-based gradient boosting algorithm from machine learning. The limit is considered in the vanishing-learning-rate asymptotic, that is when the learning rate tends to zero and the number of gradient trees is rescaled accordingly. For this purpose, we introduce a new class of randomized regression trees bridging totally randomized trees and Extra Trees and using a softmax distribution for binary splitting. Our main result is the convergence of the associated stochastic algorithm and the characterization of the limiting procedure as the unique solution of a nonlinear ordinary differential equation in a infinite dimensional function space. Infinitesimal gradient boosting defines a smooth path in the space of continuous functions along which the training error decreases, the residuals remain centered and the total variation is well controlled.

equation, gradient, regression tree, (17 more...)

arXiv.org Machine Learning

2104.13208

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > New York (0.04)
Asia > Middle East > Jordan (0.04)
(3 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.92)

Add feedback

agtboost: Adaptive and Automatic Gradient Tree Boosting Computations

Lunde, Berent Ånund Strømnes, Kleppe, Tore Selland

arXiv.org Machine LearningAug-28-2020

Gradient tree boosting (GTB) (Friedman 2001; Mason, Baxter, Bartlett, and Frean 1999) has risen to prominence for regression problems after the introduction of xgboost (Chen and Guestrin 2016). The GTB model is an ensemble-type model, that consist of classification and regression trees (CART) (Breiman, Friedman, Stone, and Olshen 1984) that are learned in an iterative manner. GTB models are very flexible in that they automatically learn nonlinear relationships and interaction effects. However, with the increased flexibility of GTB models comes substantial worries of overfitting. The top performing gradient tree boosting libraries, such as xgboost, LightGBM (Ke, Meng, Finley, Wang, Chen, Ma, Ye, and Liu 2017) and catboost (Dorogush, Ershov, and Gulin 2018), all come with a large number of hyperparameters available for manual tuning to constrain the complexity of the GTB models. Training of gradient tree boosting models, in general, thus require some familiarity with both the chosen package, and the data for efficient tuning and application to the problem at hand. The main focus of the hyperparameters and tuning are to solve the following problems: - The complexity of trees: What are the topology of all the different trees?

artificial intelligence, iteration, machine learning, (18 more...)

arXiv.org Machine Learning

2008.12625

Country:

Europe > Austria > Vienna (0.14)
Europe > Norway > Western Norway > Rogaland > Stavanger (0.05)
North America > United States > New York > New York County > New York City (0.04)
(2 more...)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)

Add feedback

An information criterion for automatic gradient tree boosting

Lunde, Berent Ånund Strømnes, Kleppe, Tore Selland, Skaug, Hans Julius

arXiv.org Machine LearningAug-13-2020

This article is motivated by the problem of selecting the functional form of trees and ensemble size in gradient tree boosting (Friedman, 2001; Mason et al., 2000). Gradient tree boosting (GTB) has become extremely popular in recent years, both in academia and industry: At present, an increase in the size of datasets, both in the number of observations and the richness of the data, or number of features, is seen. This, coupled with an exponential increase in computational power and a growing revelation and acceptance for data-driven decisions in the industry makes for an increasing interest in statistical learning (Hastie et al., 2001). For these new datasets, standard statistical methods such as generalized linear models (McCullagh and Nelder, 1989) that have a fixed learning rate due to their constrained functional form with bounded complexity, struggle in terms of predictive power, as they stop learning at certain information thresholds. The interest is therefore geared towards more flexible approaches such as ensembles of learners.

artificial intelligence, machine learning, reduction, (18 more...)

arXiv.org Machine Learning

2008.05926

Country:

Europe > Norway > Western Norway > Rogaland > Stavanger (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > New York > New York County > New York City (0.04)
(2 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)

Add feedback

A Fast Sampling Gradient Tree Boosting Framework

Zhou, Daniel Chao, Jin, Zhongming, Zhang, Tong

arXiv.org Machine LearningNov-20-2019

As an adaptive, interpretable, robust, and accurate meta-algorithm for arbitrary differentiable loss functions, gradient tree boosting is one of the most popular machine learning techniques, though the computational expensiveness severely limits its usage. Stochastic gradient boosting could be adopted to accelerates gradient boosting by uniformly sampling training instances, but its estimator could introduce a high variance. This situation arises motivation for us to optimize gradient tree boosting. We combine gradient tree boosting with importance sampling, which achieves better performance by reducing the stochastic variance. Furthermore, we use a regularizer to improve the diagonal approximation in the Newton step of gradient boosting. The theoretical analysis supports that our strategies achieve a linear convergence rate on logistic loss. Empirical results show that our algorithm achieves a 2.5x--18x acceleration on two different gradient boosting algorithms (LogitBoost and LambdaMART) without appreciable performance loss.

algorithm, gradient, gradient tree, (16 more...)

arXiv.org Machine Learning

1911.0882

Country: North America > United States (0.04)

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.36)

Add feedback

Fair Adversarial Gradient Tree Boosting

Grari, Vincent, Ruf, Boris, Lamprier, Sylvain, Detyniecki, Marcin

arXiv.org Artificial IntelligenceNov-18-2019

--Fair classification has become an important topic in machine learning research. While most bias mitigation strategies focus on neural networks, we noticed a lack of work on fair classifiers based on decision trees even though they have proven very efficient. In an up-to-date comparison of state-of- the-art classification algorithms in tabular data, tree boosting outperforms deep learning [1]. For this reason, we have developed a novel approach of adversarial gradient tree boosting. The objective of the algorithm is to predict the output Y with gradient tree boosting while minimizing the ability of an adversarial neural network to predict the sensitive attribute S . The approach incorporates at each iteration the gradient of the neural network directly in the gradient tree boosting. We empirically assess our approach on 4 popular data sets and compare against state-of- the-art algorithms. The results show that our algorithm achieves a higher accuracy while obtaining the same level of fairness, as measured using a set of different common fairness definitions. I NTRODUCTION Machine learning models are increasingly used in decision making processes. In many fields of application, they generally deliver superior performance compared with conventional, deterministic algorithms. However, those models are mostly black boxes which are hard, if not impossible, to interpret.

algorithm, classifier, fairness, (15 more...)

arXiv.org Artificial Intelligence

1911.05369

Country:

Europe > France > Île-de-France > Paris > Paris (0.04)
North America > United States (0.04)

Genre:

Research Report > New Finding (0.48)
Research Report > Promising Solution (0.48)
Overview > Innovation (0.34)

Industry:

Information Technology > Security & Privacy (1.00)
Law (0.93)
Banking & Finance (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.94)

Add feedback

Algorithm Helps Sensors on Parkinson's Patients Measure Tremor Severity in Daily Life, Study Says

#artificialintelligenceNov-4-2019, 18:52:26 GMT

Researchers have developed algorithms that work with wearable sensors to continuously monitor tremor, and estimate total tremor, in Parkinson's patients as they go about their daily routines. Analyses of sensor results using one algorithm, in particular, were similar to an established test assessing tremor without being dependent on the time the test is given. The study, "Wearable Sensors for Estimation of Parkinsonian Tremor Severity during Free Body Movements," was published in Sensors. Resting tremor, or the rhythmic shaking of muscles while relaxed, is among the motor symptoms of Parkinson's disease (PD), and some patients also have active tremor, or shaking while engaged in voluntary muscle movement. Others motor symptoms are slowness of movement (bradykinesia), rigidity, and problems with posture, balance, and gait.

algorithm, parkinson, tremor, (12 more...)

#artificialintelligence

Genre: Research Report > New Finding (0.31)

Industry:

Health & Medicine > Therapeutic Area > Neurology > Parkinson's Disease (1.00)
Health & Medicine > Therapeutic Area > Musculoskeletal (1.00)

Technology:

Information Technology > Human Computer Interaction > Interfaces (0.85)
Information Technology > Hardware (0.85)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

Synced Tree Boosting With XGBoost – Why Does XGBoost Win "Every" Machine Learning Competition?

#artificialintelligenceApr-8-2018, 02:57:27 GMT

Tree boosting has empirically proven to be efficient for predictive mining for both classification and regression. For many years, MART (multiple additive regression trees) has been the tree boosting method of choice. But a starting from 2015, a first to try, always winning algorithm surged to the surface: XGBoost. This algorithm re-implements the tree boosting and gained popularity by winning Kaggle and other data science competition. In the thesis Tree Boosting With XGBoost – Why Does XGBoost Win "Every" Machine Learning Competition, the author Didrik Nielsen from Norwegian University of Science and Technology is trying to: The paper introduce in first place the supervised learning task and discuss the model selection techniques.

algorithm, tree model, xgboost, (13 more...)

#artificialintelligence

Country: North America > United States > New York > New York County > New York City (0.04)

Genre: Contests & Prizes (0.34)

Industry: Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.49)

Add feedback

Accelerated Gradient Boosting

Biau, Gérard, Cadre, Benoît, Rouvìère, Laurent

arXiv.org Machine LearningMar-6-2018

Gradient tree boosting is a prediction algorithm that sequentially produces a model in the form of linear combinations of decision trees, by solving an infinite-dimensional optimization problem. We combine gradient boosting and Nesterov's accelerated descent to design a new algorithm, which we call AGB (for Accelerated Gradient Boosting). Substantial numerical evidence is provided on both synthetic and real-life data sets to assess the excellent performance of the method in a large variety of prediction problems. It is empirically shown that AGB is much less sensitive to the shrinkage parameter and outputs predictors that are considerably more sparse in the number of trees, while retaining the exceptional performance of gradient boosting.

algorithm, artificial intelligence, machine learning, (17 more...)

arXiv.org Machine Learning

1803.02042

Country: North America > United States > California (0.28)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)

Add feedback

Tree Boosting With XGBoost -- Why Does XGBoost Win "Every" Machine Learning Competition?

@machinelearnbotNov-30-2017, 18:43:07 GMT

Tree boosting has empirically proven to be efficient for predictive mining for both classification and regression. For many years, MART (multiple additive regression trees) has been the tree boosting method of choice. But a starting from 2015, a first to try, always winning algorithm surged to the surface: XGBoost. This algorithm re-implements the tree boosting and gained popularity by winning Kaggle and other data science competition. The paper introduce in first place the supervised learning task and discuss the model selection techniques.

artificial intelligence, machine learning, xgboost, (16 more...)

@machinelearnbot

Country: North America > United States > New York (0.14)

Genre: Contests & Prizes (0.34)

Industry: Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.49)

Add feedback